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Histone 3 lysine 4 trimethylation (H3K4me3) is known to be associated with transcriptionally active or 
poised genes and required for postnatal neurogenesis within the subventricular zone (SVZ) in the 
rodent model. Previous comparisons have shown significant correlation between baboon {Papio anubis) 
and human brain. In this study, we demonstrate that chromatin activation mark H3K4me3 is present in 
undifferentiated progenitor cells within the SVZ of adult baboon brain. To identify the targets and 
regulatory role of H3K4me3 within the baboon SVZ, we developed a technique to purify 
undifferentiated SVZ cells while preserving the endogenous nature without introducing culture artifact 
to maintain the in vivo chromatin state for genome-wide studies (ChlP-Seq and RNA-Seq). Overall, 
H3K4me3 is significantly enriched for genes involved in cell cycle, metabolism, protein synthesis, 
signaling pathways, and cancer mechanisms. Additionally, we found elevated levels of H3K4me3 in the 
MRI-dassified SVZ-associated Glioblastoma Multiforme (GBM), which has a transcriptional profile 
that reflects the H3K4me3 modifications in the undifferentiated progenitor cells of the baboon SVZ. 
Our findings highlight the importance of H3K4me3 in coordinating distinct networks and pathways for 
life-long neurogenesis, and suggest that subtypes of GBM could occur, at least in part, due to aberrant 
H3K4me3 epigenetic regulation. 



Brain regions retaining neural stem cells (NSCs) for postnatal and adult neurogenesis include the subven- 
tricular zone (SVZ) on the walls of the lateral ventricles and the subgranular zone within the dentate gyrus 
of the hippocampus' *. Within the SVZ, there is a population of slowly dividing NSCs with astrocyte -like 
morphology. These NSCs give rise to transit-amplifying cells, which subsequently give rise to immature neuro- 
blasts. In the rodent, neuroblasts ultimately migrate through the rostral migratory stream to generate interneur- 
ons in the olfactory bulb^ Numerous studies have demonstrated that extracellular signals'"'" intracellular 
regulators''''^''\ and epigenetic mechanisms' have significant effects on the self-renewal and lineage commitment 
of NSCs. An epigenetic mechanism through MLLl (mixed-lineage leukaemia 1, a histone methyl transferase 
responsible for histone 3 lysine 4 trimethylation-H3K4me3) has been shown to induce postnatal neurogenesis in 
SVZ'\ The MLL family methyltransferases (MLLl , MLL2, MLL3, MLL4, MLL5) reside within the Trithorax (Trx) 
group protein complex, which also contains WDR5/RbBP5/ASH2 subunits to convert the demethylated forms of 
H3K4 to the trimethylated form in order to render open chromatin'^"". To date, the emerging consensus from 
genome- wide scale analysis reveals that tri-methylation at H3K4 (H3K4me3) is characteristic of transcriptionally 
active or poised status""*. Importantly, the H3K4me3/MLL family is also known to be associated with distinct 
types of hematologic disorders'"'""'. Thus, oncogenesis under MLL deregulation could result in changes of gene 
expression through the alteration of H3K4me3 epigenetic landscape. 

In this work, we demonstrate that H3K4me3 is present in progenitor cells residing in the primate SVZ, where 
adult neurogenesis occurs. To uncover the function of H3K4me3 in this adult germinal niche, we have chosen to 
utilize the nonhuman primate- baboon {Papio anubis) for genome-wide ChlP-Seq (chromatin immunoprecipi- 
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Figure 1 | Gross anatomy of baboon SVZ architecture and the colocalization of H3K4me3 and cell type specific markers in baboon SVZ. (A) Scheme 
displays coronal view of baboon brain with dictation of lateral ventricle (LV) and subventricular zone (SVZ). (B) Coronal sHce of baboon forebrain showing 
regions sampled, the arrow points to the head of the caudate nucleus at the margin with the lateral ventricle. The dorsal is to the left and medial is at the 
bottom for orientation. 2X magnification. (C) Horizontal slice through the caudate nucleus showing the subventricular zone (SVZ) in temporal horn of 
lateral ventricle; arrow heads point to SVZ; 6X magnification. (D) Images of the caudal ventral subventricular zone reveal the GFAP-positive astrocytic 
ribbon. A deeper z-stack imaging was performed in order to view GFAP-positive processes of the NSCs extending toward the lateral ventricle (LV). 
(E) Vimentin staining shows NSC population in the baboon SVZ. (F) PSA-NCAM staining shows neuroblast population in the baboon SVZ. (G) A small 
population of H3K4me3 positive cells is colocalized with GFAP in rostral ventral SVZ. (H) Significant population of Vimentin-positive cells is colocalized 
with H3K4me3. (I) H3K4me3 persists in the PSA-NCAM positive neuroblast population. (D-I) 40X images; inserts are lOOX magnification; White arrow 
heads in G and H indicate colocalization of H3K4me3 and cell type markers; (G-I) Images and inserts are 2um single sUces. Scale bars present 20 [im. 



tation and deep sequencing) and RNA-Seq analyses. We developed 
a technique to purify progenitor cell populations within the SVZ 
directly from baboon brain, in which the large brain mass allows 
us to isolate sufficient SVZ cells without culture artifacts. We 
identified that H3K4me3 is significantly enriched for genes 
involved in cell cycle, cell signaling, nervous system development, 
metabolism, and ribosomal biogenesis. Among these genes, RNA- 
Seq identified that genes associated with cellular signaling/main- 
tenance, DNA replication, metabolism, and protein synthesis are 
expressed. Our findings highlight that H3K4me3 is critical for SVZ 
cellular activity of life-long neurogenesis. Importantly, we found 
that subtypes of SVZ-associated Glioblastoma Multiforme (GBM) 
have increased levels of H3K4me3, in which changes in subsets of 
gene expression under aberrant H3K4me3 activation were also 
identified in these two subtypes of GBM (group I and II GBM). 
We anticipate that aberrant regulation by H3K4me3 in this adult 
germinal niche may potentiate tumorigenesis in the context of 
neurogenesis as this supports the growing conviction of epigenetic 
alterations in malignancy^''^^. 



Results 

Chromatin activation mark H3K4me3 in primate progenitor cells 
within adult SVZ. We utilized the previously characterized NSCs and 
neuroblasts markers^"" including GFAP, Vimentin, and PSA-NCAM 
for gross anatomical examination of the SVZ in non-human primate- 
baboon (Fig. 1). We found that GFAP-positive astrocytic ribbon lines 
this niche and some GFAP-positive cells extend toward the ventricle 
(Fig. ID), resembling the previous finding in adult human brain^^"^^. 
Both Vimentin (NSC marker) and PSA-NCAM (neuroblast marker) 
also label the SVZ cells in the baboon brain (Fig. IE, F). To examine 
which cell population in the baboon SVZ is enriched with H3K4me3, 
we applied co-immunostaining for H3K4me3 and cell type specific 
markers- GFAP, Vimentin, and PSA-NCAM representing NSCs and 
neuroblasts, respectively. We found that H3K4me3 is associated with 
small population of GFAP-positive NSCs, colocalized with Vimentin- 
positive cells, and persistent in PSA-NCAM-positive neuroblasts 
(Fig. IG-I). Given the significant correlation of brain architecture 
and genomic level between baboon and human^""'^^, we reason 
findings in baboon are relevant to human. In the parallel experi- 
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ment, we also found that H3K4me3 is present in the adult mouse 
SVZ co-localizing with GFAP, Mashl, and doublecortin (DCX)/ 
PSA-NCAM representing NSCs, transit- amplifying progenitors, 
and neuroblasts, respectively (Supplementary Fig. 1). We per- 
formed a 2-hour EdU administration in mice to label quickly 
dividing transit- amplifying cells and neuroblasts, and found colocali- 
zation of EdU and H3K4me3 (Supplementary Fig. ID). Further, we 
performed a cytosine-P-D-arabinofuranoside (AraC) ablation to 
validate the H3K4me3 presence in the SVZ. Antimitotic treatment 
via the infusion of AraC into adult mouse brain eliminates the 
quickly dividing transit amplifying cells and neuroblasts in the 
SVZ but leaves slowly dividing stem cells unaffected^"'^'^. Notably, 
the population of H3K4me3 positive cells was significantly de- 
creased (Supplementary Fig. 1), indicating that H3K4me3 is 
associated with transit amplifying cells and neuroblasts in addition 
to NSCs. Our results reveal that previously characterized markers for 
distinct SVZ ceUs in the rodent and human brain can also represent 
specific cell types in the baboon SVZ, and that H3K4me3 is present 
in the undifferentiated SVZ cells consistent across species. 

H3K4me3 epigenetic landscape in primate progenitor cells within 
adult SVZ. Given the presence of H3K4me3 in the baboon SVZ cells, 
the molecular targets of this chromatin active mark in adult SVZ 
remain to be determined. To this end, we sought to identify the 
genomic loci that carry the H3K4me3 modification in the SVZ 
cells of the baboon, which share 92% sequence homology to 
humans at the genomic level. Additionally, comparisons across 
primate species and human have shown significant correlation in 
terms of brain volume and structure between baboon and 
human^*"'^^. However, the spatial location and complexity of cell 
types in SVZ limit the dissection of discrete populations of cells 
within this niche for analysis. Nevertheless, the in vitro cell culture 
system cannot recapitulate in vivo cellular and molecular 
mechanisms that regulate identity and plasticity of the SVZ cells. 
To overcome this limitation and to ascertain characteristics of the 
SVZ cells as they exist in vivo, we developed a technique to purify 
SVZ cells directly from baboon brain within a short post-mortem 
interval (<20 minutes). After SVZ dissection, dissociated cells were 
analyzed by flow cytometry that showed —8% GFAP-positive (NSC 
marker) and 15% DCX-positive (early and migrating neuroblast 
marker) without mature neuron contamination from nearby 
caudate (Fig. 2A). For purification, we manually conjugated 
Dynabeads to antibodies against SVZ cell type-specific markers, 
and then used the antibody-conjugated Dynabeads to purify 
dissociated cells following dissection of the SVZ. During 
developing this purification technique, we ran flow cytometry to 
validate the recovery of NSC and neuroblast cells. In the 
population purified by Vimentin-conjuagated Dynabeads, 59.8% of 
eluted cells expressed GFAP, and there was no DCX* cells 
(neuroblast marker) in this population. Likewise, 56.5% Vimentin- 
positive cells were recovered from GFAP-Dynabeads purification 
(Fig. 2C). These are not surprising results since GFAP labels both 
quiescent and active NSCs and Vimentin labels relatively active 
NSCs. As DCX labels both early and migrating neuroblasts and 
PSA-NCAM labels migrating neuroblasts, Dynabeads-DCX 
recovered 63% PSA-NCAM* population (Fig. 2D). To ensure no 
contamination of matured cells from the neighboring striatum, we 
also examine the presence of GAD65, which labels the mature 
neurons. We found no detectable level of GAD65 by flow 
cytometry. Our technical innovation preserves the in vivo nature of 
undifferentiated SVZ cells and is ideal for uncovering H3K4me3 
enriched loci through the genome-wide ChlP-Seq approach. Each 
set of purified NSCs, neuroblasts, or combined progenitor cells 
(including NSCs and neuroblasts) from SVZ were subjected to 
Chip with an antibody specific for H3K4me3 and sequenced to 
high depth (Fig. 3A). The baboon gene annotation is not currently 



available, however, there is only 2% difference at genomic level 
between baboon and rhesus macaque. Thus, the resulting reads 
were alternatively compiled, processed, and aligned to the Jan. 
2006 rhesus macaque (Macaca mulatta) draft assembly, 
Mmul_051212'° (http://www.ncbi.nlm.nih.gov/genome/assembly/ 
237568/) and the UCSC genome browser version: rheMac2. 
Hotspot was used to identify H3K4me3-enriched loci (FDR = 
0.01)"". Subsequently, H3K4me3-enriched genes were identified to 
be associated to their closest RefSeq gene and putative transcription 
start sites (TSS) by using the closest-features program from the 
BEDOPS tool set'" (Fig. 3B-F; Supplementary Table 1). Based on 
rhesus macaque annotation, the enrichment of H3K4me3 located 
less than 1 kb away from annotated TSS is approximately 70% of 
H3K4me3 target genes, and —8% of H3K4me3-enriched regions are 
>10 kb away from annotated TSS. Therefore, the distribution of 
H3K4me3 enrichment is preferentially close to TSS in the 
undifferentiated SVZ cells. Gene ontology (GO) analysis for the 
pool of total progenitor cells including NSCs and neuroblasts 
reveals that molecular functions of these genes include cell cycle, 
cell signaling, development, metabolism, neuronal differentiation, 
and ribosomal biogenesis (Fig. 3G and Supplementary Table 1). In 
addition, the top biological networks predicted by Ingenuity 
Pathways Analysis (IP A) are involved in 1) nervous system 
development and function; 2) post-transcriptional and post- 
translational modifications; 3) molecular transport; 4) ribosomal 
biogenesis; 5) metabolism (Fig. 3G and Supplementary Fig. 2-5). It 
is of interest to distinguish the differential regulation by H3K4me3 in 
NSC- versus neuroblast-specific populations. We therefore 
conducted 2 additional ChlP-Seq experiments for purified NSCs 
and neuroblasts separately. Importantly, the overrepresented genes 
in the purified NSC population are significantly enriched in 
categories of cell cycle, cell death, DNA repair, and proliferation, 
whereas the statistical over-representation among genes in the 
purified neuroblast population is highly associated with ribosomal 
biogenesis (Supplementary Fig. 6 and Supplementary Table 1). 
Among the shared targets in both NSCs and neuroblasts, 
H3K4me3 is enriched for genes in chromatin modification, 
neuronal function, and signaling (Supplementary Fig. 6). Our data 
highlights that the interplay among these biological networks and 
pathways coordinates adult neurogenesis through H3K4me3 
regulation. Intriguingly, the top canonical pathway by IPA 
prediction from the purified NSCs and combined populations 
(NSCs and neuroblasts) is "molecular mechanisms of cancer" (p- 
value: 4.05E-08; Supplementary Fig. 4), in addition to mitochondria 
function and several signaling pathways, such as eIF4-, RAR-, and 
Ubiquitin- pathways (Supplementary Fig. 5 and Supplementary 
Table 1). Although MLL/H3K4me3 overexpression was known to 
be connected to hematologic diseases our finding is the first 
implication of oncogenic pathways in the SVZ through H3K4me3 
regulation. 

While H3K4me3 is characteristic of transcriptionally active status 
in differentiated cells, in embryonic stem cells (ESCs), many genes 
harboring H3K4me3 are poised for timely activation upon proper 
differentiation signals"''". Perhaps, the undifferentiated SVZ cells 
underlying H3K4me3 regulation during adult neurogenesis is remin- 
iscent of similar manner in ESCs. Herein, we seek to determine the 
functional consequences of H3K4me3 enrichment in terms of 
expression of its target genes. Through RNA-Seq analysis for undif- 
ferentiated SVZ cells purified from baboon brain (Fig. 3A), we found 
that 72% of H3K4me3 enriched genes in undifferentiated SVZ cells 
are expressed (1913 genes are detectable by RNA-Seq; 2663 genes 
enriched with H3K4me3 by ChlP-Seq), in that genes are significantly 
associated with cell signaling, cellular morphology and maintenance, 
DNA replication and repair, metabolism, and protein synthesis 
(Fig. 3H). Our results suggest that H3K4me3 plays potential roles 
for the integrity of undifferentiated SVZ cells while neurogenesis 
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Figure 2 | Flow qftometry analyzes the purity of SVZ cells after dissection and Dynabeads/magnetic isolation. (A) After SVZ dissection, dissociated cells 
were analyzed by flow cytometry that showed —8% GFAP-positive (quiescent NSC marker) and 15% DCX-positive (early and migrating neuroblast 
marker) without contamination of mature cells from nearby caudate (as mature neuron markers NeuN and GAD65 are negative). (B-D) During 
developing this purification technique, we ran flow cytometry to validate the recovery of NSC and neuroblast cells. (B) For NSCs purification, Vimentin is 
also a NSC marker, and Dynabeads-Vimentin recovered 59.8% GFAP* population, but no DCX* cells. (C) 56.5% Vimentin-positive cells were recovered 
from GFAP-Dynabeads purification, supporting the notion that GFAP labels quiescent and active NSCs and Vimentin labels relatively active NSCs. 
(D) As DCX labels both early and migrating neuroblasts, Dynabeads-DCX recovered 63% PSA-NCAM* population (PSA-NCAM labels migrating 
neuroblast), but no GAD65* cells (GAD65 represents mature neuron marker). Our dissection and purification yield purely undifferentiated SVZ cells 
without mature cell contamination. 



proceeds throughout life time. On the other hand, 750 out of total 
2663 H3K4me3-enriched genes are not detectable by RNA-Seq, sug- 
gesting 28% of genes in SVZ are poised for activation (Fig. 3H). 



Lastly, our RNA-Seq identified 1 1099 genes are expressed in baboon 
SVZ, whereas only 1913 out of 11099 of genes in undifferentiated 
SVZ cells are H3K4me3-enriched and also detectable by RNA-Seq 



SCIENTIFIC REPORTS | 4:5371 | DOI: 1 0. 1 038/srep05371 



4 



© © 




Anti-cell type specific antibody 
conjugated Dynabeads® 



Micro-dissect and Purify via anti-cell 
dissociate cells type marker conjugated 

from SVZ Dynabeads® 



ChlP-Seq 



© © 



© 



Elution of cells 
and shearing 



aH3K4me3 Antibcxiy 
conjugated Dynabeads® 



lllumina HiSeq2000 
Deep Sequencing 
and analysis 



RNA-Seq 



Chromatin 
Immunoprecipitation 
and DNA purification 



B 



chi«: 105.352.000 1 



H3K4nie3 signal 
H3K4iiie3 peaks 



1 T 



SVZ 



1 kbh- 



chM5: 09,130.000 | 0S.13S.000 | 

C RPL12 I |,»...>|..|,»>m 4^ 

H3K4nie3 signal 



D 



H3K4me3 peaks 

ehrl3: 116.015.0001 



■ I I I 




2hb|— 



H 



.^1 iL 



H3K4me3 signal 
H3K4me3 peaks 



chf6: 151.310.000 I 

^ MRPL22^ 

H3K4me3 signal 



I ■ 



ChlP-Seq 



n U it . J 1 . J . 

1 1 II I 



151.320.0001 



H3K4iTie3 peaks 

Chris: 15.040.000 j 

p RBBP8 f))l»)))»)))))l) < 
H3K4me3 signal 



101(b 1- 



H3K«ne3 peaks 



^ I - i - - - - l-li— 



15.950.0001 



5 kb t— 



H3K4me3 enriched for networks and 
pattiways in: 

- Nervous system development and 
function 

- IVIolecular transport 

- Ribosomal biogenesis 

- Metabolism 

- RNA post-transcriptionai modification 

- RAR signaling 

- Pattiways in mitoctiondria function 

- Protein ubiquination 

- Cell cycle/death/proliferation 

- Pattiway to molecular mechanisms of 
cancer 



RNA-Seq 




■ Celi signaling 

- Celiular morphoiogy & maintenance 

■ DNA replication/repair 

■ Metabolism 

■ Protein synthesis 



Figure 3 | ChlP-Seq and RNA-Seq analyses for in vivo SVZ cells from baboon brain. (A) Scheme displays purification of the in vivo SVX cells for 
ChlP-Seq and RNA-Seq. For ChlP experiments, DCX- and PSA-NCAM-Dynabeads were used for neuroblast purification, GFAP- and Vimentin- 
Dynabeads were used for NSC purification. For the pool of undifferentiated SVZ cells, Dynabeads conjugated-GFAP, -Vimentin, -Nestin, -DCX, and 
-PSANCAM were used for purification. (B-F) Peak views of representative genes enriched with H3K4me3 in baboon SVZ cells. The arrows indicate the 
annotated transcription start sites (TSS). The H3K4me3 signal is "mapped read density" after normalization to unmodified H3. The H3K4me3 peaks 
were called at FDRO.Ol, which is derived from a comparison of mapped read enrichment relative to a local background model based on the binomial 
distribution. (G) For characterization of H3K4me3 enriched genes in baboon SVZ cells, IPA prediction reveals the top canonical pathways and networks 
under H3K4me3 regulation in the undifferentiated SVZ cells including NSCs and neuroblasts. (H) RNA-Seq analysis uncovers 11099 genes in the 
undifferentiated SVZ cells are detectable at transcriptional level, while 1913 expressed genes are enriched with H3K4me3 as noted "magenta" portion. 



(Fig. 3H). As the notion of many other factors involved in gene 
activation, such as opened chromatin through histone acetylation, 
it is not a surprising result that only —20% of H3K4me3-enriched 
genes in undifferentiated SVZ cells are expressed. 

Potential link between aberrant H3K4me3 activation and 
tumorigenesis in subtypes of GBM. The SVZ is a suspected origin 
for particular types of brain tumor""^'^. In support of this notion, 
MRI-classified Glioblastoma Multiforme (GBM) has revealed that 
group I GBM is arisen from SVZ and group II GBM is associated with 
SVZ with migratory characteristics'*". Although in this study, we did 
not directly address whether group I and II GBM arise from SVZ, our 
previous findings through proteomic approach identified that 
markers representing progenitor cells in SVZ including GFAP, 
Vimentin, and DCX were highly expressed in these GBM tumors'". 
Collectively, i) the close relationships between subtypes of GBM and 
SVZ; ii) the presence of H3K4me3 in progenitor cells within adult 



SVZ (Fig. 1); iii) a set of H3K4me3-enriched genes are involved in 
cancer pathways (Supplementary Fig. 4); and iv) the known 
association of MLL/H3K4me3 with neoplasia'*^-''^ prompted us to 
examine whether the enrichment of H3K4me3 is linked to SVZ- 
associated GBM. To this end, specimens from MRI-characterized 
group I and II GBM as well as control regions (Fig. 4A) were 
analyzed for the relative abundance of H3K4me3 by western blot. 
The H3K4me3 levels were increased in both group I and II GBM 
compared to correlated normal brain regions (Fig. 4B). As the 
pathological consequences of excessive enzymes involved in 
chromatin modifications were recognized to be harmful, we then 
examined the expression level of MLL family methyltransferases 
(MLLl, MLL2, MLL3, MLL4, MLL5) and subunits (Ash2, RbBPS, 
WDR5) within the core complex responsible for trimethylation of 
H3K4. There was no difference of MLL3-5 or subunits between the 
GBM and control specimens, whereas the expressions of MLLland 
MLL2 at both transcript and protein levels were increased in group I 
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Figure 4 | The levels of H3K4me3, MLLl, MLL2, and subset of H3K4me3-enriched genes are elevated in group I and group II GBM. (A) Scheme 
displays the control regions andMRI scanned group I and group II GBM. (B) The levels of H3K4me3 were analyzed by western blot. Total histone 3 (H3) 
were used as loading control. (C) A heatmap for expression of genes in MLL complex (log2 scale) by quantitative RT-PCR for human GBM specimens. 
(D) Analysis of MLLl and MLL2 protein levels and H3K4me3 abundance in GBM and control specimens by western blot, y-tubulin and unmodified H3 
and H4 were used as loading controls. (E) A heatmap for expression of selected 49 genes by quantitative RT-PCR for human GBM specimens. Inset shows 
color scale indicating differences in expression level. The white color with symmetry to 0 defines no difference at expression level between GBM and the 
controls. The red color scales increased expression level of genes in GBM compared to control and the blue color scales decreased expression in GBM 
relative to control. The dendrogram was computed by hierarchical clustering with Euclidean distance metric and complete linkage. (F) Venn diagram 
displays the overlap among genes enriched with H3K4me3 identified from ChlP-Seq of SVZ cells (NSCs and NBs), expressed in SVZ cells identified from 
RNA-Seq of baboon SVZ cells as noted "magenta" portion (n = 1913), and substantially increased in SVZ-associated GBM cases identified from 
RNA-Seq analyses (yellow- group I GBM; light purple- group II GBM). The GO terms for overrepresented genes among this overlap are shown in the 
"box". (G) Venn diagram displays the overlap among genes which are not detectable in baboon SVZ cells by RNA-Seq as noted "green" portion 
(n = 750), but are enriched with H3K4me3 identified from ChlP-Seq of baboon SVZ cells (NSCs and NBs), and are substantially increased in SVZ- 
associated GBM cases by RNA-Seq analyses (yellow- group I GBM; light purple- group II GBM). The GO terms for overrepresented genes among this 
overlap are shown in the "box". 



and II GBM compared to control (Fig. 4C, D). Herein, we envision 
the elevated level of H3K4me3 in GBM could be due to the up- 
regulation of MLLl and MLL2. In light of the "molecular 



mechanisms of cancer" as the top canonical pathway from our 
H3K4me3 ChlP-Seq data raises the possibility that aberrant 
H3K4me3 landscape may perturb SVZ lineages and contribute to 
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tumorigenesis. To examine whether genes enriched by H3K4me3 in 
the primate SVZ cells were consequently affected in the group I and II 
GBM specimens, we applied RNA-Seq for specimens obtained from 
these types of GBM and control brain region (Fig. 4A) to delineate 
the differential gene expression in group I and II GBM at the 
genome-wide scale. Of note, the overlap between these 2 types of 
GBM includes genes functioning in chromatin remodeling, cell cycle, 
metabolism, signaling, and ribogenesis. We validated RNA-Seq data 
by qRT-PCR for selective genes from these categories. After 
normalization to control brain regions, several genes involved in 
chromatin modification, ribosomal biogenesis, and metabolism 
were increased in both group I and II GBM (Fig. 4E), resembling 
our previous proteomic results'". Consistent with RNA-Seq results, a 
number of genes were up-regulated specifically in either group I or II 
GBM by qRT-PCR validation. These differential expression 
signatures may be useful for distinguishing the molecular charac- 
teristics between group I and II GBM. H3K4me3 is substantially 
increased in group I and II GBM (Fig. 4B and D), implying that 
these tumors may have a transcriptional profile reflecting the 
H3K4me3 enrichment in progenitor cells of the SVZ. To further 
examine this relationship, we compared the overlap of RNA-Seq 
data between SVZ-associated GBM and undifferentiated SVZ cells 
from baboon brain (Supplementary Table 2). We found that 404 
genes detectable by RNA-Seq in SVZ and enriched with H3K4me3 
were aberrantly overexpressed (log2 a 2) in group I and/or group II 
GBM, respectively (Fig. 4F and Supplementary Table 2). These genes 
are significantly associated with cell signaling, cellular morphology 
and maintenance, DNA replication and repair. Additionally, 49 
genes involved in cell signaling and metabolism are expressed and 
enriched with H3K4me3 in undifferentiated SVZ cells, and are also 
upregulated in both group I and group II GBM (Fig. 4F). Intriguingly, 
a set of H3K4me3-enriched genes in undifferentiated baboon SVZ 
cells are not detectable by RNA-Seq, but are overexpressed in GBM 
(n = 85) (Fig. 4G and Supplementary Table 2). These 85 genes 
predominantly function in cell cycle, cellular growth and prolifera- 
tion, and immune response, as such reflects to the profound effect of 
aberrant H3K4me3 activation that causes proliferating cancer cells. 
Perhaps, these mitotic and proliferation signatures would be a better 
tracing of cancer cells within the surrounding SVZ cells. 

Discussion 

To glean previously unattainable information regarding the epige- 
netic regulation of H3K4me3 in the in vivo progenitor cells within the 
primate SVZ, we undertook the genome-wide ChlP-Seq and RNA- 
Seq approaches for purified SVZ cells from baboon brain. Using 
ChlP-Seq, a cohort of H3K4me3-enriched genes was identified, 
which function in chromatin modification, cell cycle, cell signaling, 
metabolism, nervous system differentiation, ribosomal biogenesis, 
and protein synthesis. As H3K4me3 has been linked to poised or 
active gene expression''''"'"', our RNA-Seq analysis reveals the 
detectable expression from a subset of H3K4me3-enriched genes that 
contribute to cellular signaling, DNA replication, metabolism, and 
protein synthesis. Our findings illustrate the importance of 
H3K4me3 in coordinating cell fates, metabolic sensing, and signaling 
pathways for life-long neurogenesis. It is worth noticing in the inde- 
pendent set of ChlP-Seq for the purified NSCs that H3K4me3 is 
preferentially enriched for genes involved in cell cycle and DNA 
replication/repair, implicating that genomic integrity in NSCs is 
maintained under H3K4me3 regulation. Further, we showed that 
H3K4me3 is predominantly enriched for genes responsible for ribo- 
genesis in the purified neuroblast population. Remarkably, a sub- 
stantial number of H3K4me3-enriched genes in both NSCs and 
neuroblasts are linked to cancer pathways. 

While MLL/H3K4me3 functions in adult neurogenesis, the sus- 
picion of SVZ as the origin for subtypes of brain tumors^"-'"' raises the 
concern that, similar to hematologic cancer''^'^", deregulation of this 



epigenetic mark could cause cancer associated with this germinal 
niche. We found elevated levels of H3K4me3 in MRI-classified group 

I and II GBM specimens, reminiscent of a recent study of chromatin 
status mapping, in that enriched occupancy of H3K4me3 was found 
in established GBM cell lines'*'*. Although previous studies have iden- 
tified mutations and the resulting abnormal expression (eg. p53, 
IDHl, IDH2) associated with distinct types of GBM""" ''^ our RNA- 
Seq result showed that the expression of those genes were not affec- 
ted, suggesting the SVZ-associated group I and II GBM is distinct 
from other characterized GBM cases. Further integrated analyses 
across ChlP-Seq (baboon SVZ cells) and RNA-Seq (baboon SVZ 
cells; group I GBM; group II GBM) data revealed that the expression 
level of 16.7% of H3K4me3-enriched genes is elevated in group I and 

II GBM, regardless whether the transcripts are detectable or unde- 
tectable in the in vivo baboon SVZ cells. Among those genes, a 
significant overrepresentation of GO terms is related to cell cycle, 
proliferation, and signaling for immune responses, which imply the 
most profound effect of H3K4me3 in these SVZ-associated GBM. 
Our findings also suggest that other factors existing in tumor micro- 
environment could initiate or enhance tumorigenesis in addition to 
aberrant accumulation of H3K4me3. However, changes in subsets of 
gene expression under aberrant H3K4me3 activation may, at least in 
part, influence cell fate in the SVZ and disrupt the maintenance of 
neurogenesis that ultimately contribute to tumorogenesis in subtypes 
of GBM. Apparently, there is a fine line that H3K4me3 must walk 
between adult neurogenesis and oncogenesis in the SVZ. Thus, 
future studies to understand the mechanism underlying the fine 
tuning regulation of H3K4me3 in SVZ cells remains to be 
determined. 

Methods 

AU animal experiments were approved by the guidelines of the Institutional Animal 
Care and Use Committee of the University of Texas at San Antonio (UTSA) and 
Texas Biomedical research Institute/Southwest National Primate Research Center 
(SNPRC) at San Antonio. The experimental materials involving human specimens 
were approved by the Institutional Review Board (IRB) before starting research. 

Cell types purification for ChlP-Seq analysis. We manually conjugated antibody 
against cell type markers, such as GFAP, Nestin, Vimentin, PSA-NCAM, or 
Doublecortin to Dynabeads. We then used the Dynabeads-conjugated antibody to 
purify cells dissociated from SVZ microdissection. Briefly, cells from fresh dissected 
baboon SVZ were immediately dissociated with Accutase, equilibrated in binding 
buffer containing phosphate-buffered saline (PBS), 0.05% TritonX-100 (or saponin, 
detergent choice depends upon antibody), and subsequently subjected to Dynabeads- 
conjugated antibody purification. After elution with high salt and pH-gradient buffer, 
the purified populations were crosslinked in 1.1% formaldehyde before chromatin 
shearing by Diagenode Bioruptor. The resulting sheared chromatin fragments in a 
size range between 200 to 500 base pairs were incubated with H3K4me3 antibody- 
conjugated Protein A Dynabeads (MiUipore #07-473,1 : 1000; Active Motif #39160, 
1 : 500; Life Technology Dynabeads protein A) overnight. For normalization, the 
aliquot of sheared chromatin fragments were incubated with antibody against total 
histone 3-conjugated Protein A Dynabeads (unmodified H3 antibody, MiUipore #05- 
499; 1 : 1000). Subsequently, enriched chromatin fragments were eluted, subjected to 
de-crosslink, purified for library preparation (Illumina Library Kit), and deep- 
sequencing with 200 million tags through lUumina HiSeq2000 sequencer. 

Sequence alignment and peak calling. The Rhesus macaque gene annotation was 
constructed and is maintained at the UCSC and is derived directly from the NCBI 
RefSeq project (http://nar.oxfordjournals.org/content/33/suppl_l/D501.full). 
Alignments were generated by using the Bowtie alignment program version 0.12.7., 
with a maximum of 2 mismatches allowed in the retained uniquely mapping reads. 
Aligned read enrichments where detected using the Hotspot peak finding program 
that is based upon a binomial distribution model, with built-in, automatic 
background adjustment. Hotspot to RefSeq gene associations were established in a 
distance ranked manner by using the closest-features program from the BEDOPS tool 
set"*^ (https://code.google.eom/p/bedops/). 

RNA-Seq analysis. Total RNA was extracted from purified baboon SVZ cells and 
frozen GBM specimens using TRIzol reagent and sequencing libraries were generated 
with Illumina RNA-Seq library kit. Paired-end RNA-deepSeq (76 base pair; >300 
million tag reads) were aligned to hgl9. DESeq was used to normalize raw read 
counts; and Cufflink reports read counts and estimated FPKM (fragments per 
kUobase of exon per million fragments mapped; http://cufflinks.cbcb.umd.edu/faq. 
html#fpkm). Genes with expression values > 1 FPKM were considered for subsequent 
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analyses. Cuffdiff was applied to analyze the differential gene expression between 
GBM and control. Area-proportional Venn Diagrams were generated using the R 
package "Vennerable" (http://r-forge.r-project.org/projects/vennerable/). 

GO, network, and pathway analysis. Gene ontology analysis was performed using 
the DAVID functional analysis tool*^. The Bonferroni, Benjamini, and FDR (false 
discovery rate) were used for multiple test correction. Functional pathway and 
network analysis of bound loci were performed using Ingenuity Pathway Analysis 
(IP A) (Ingenuity® Systems, Redwood City, CA, USA). Loci, known as focus genes, 
were grouped into ontology classes. The Ingenuity Knowledge Base, a repository of 
biological and chemical interactions, was used as a reference set for pathway and 
network inference. Over-represented signaling and metabolic canonical pathways in 
the input data were determined based on two parameters: 1) The ratio of the number 
of molecules from the focus loci set that map to the pathway divided by the total 
number of molecules that map to the canonical pathway, and 2) a p value calculated 
by Fisher's exact test that determines the probability that the association between the 
focus loci and the canonical pathway is explained by chance alone. Network analysis 
used focus loci as "seeds" to infer de novo interaction networks. Direct or indirect 
interactions between focus loci and other molecules were inferred based on 
experimentally observed relationships supported by at least one reference from the 
literature. Additional molecules from the Ingenuity Knowledge Base were added to 
the network to fiU or join smaller networks. The network score was based on the 
hypergeometric distribution and calculated with the right-tailed Fisher's exact test. A 
higher score indicates a lower probability of finding the observed number of focus 
molecules in a given network by chance. 

Co-immunostaining and confocal imaging. £c/[/ tracing: Mice were administered 
with EdU (150 |j,g/body weight) by IP, and subsequently perfused/ fixed with IX 
PBS and 4% Paraformaldehyde 2 hours after EdU injection. Brain sections were 
processed and permeabalized with 0.5% Triton X-100 for 20 minutes at room 
temperature prior to the detection assay of EdU incorporation under manufacture 
instruction (Invitrogen Click-it Edu imaging kit: copper- catalyzed covalent 
reaction between the Edu alkyne and the Alexa Fluor® dye azide). Edu was labeled 
by Alexa Fluor® 488 and imaged under Zeiss LSM 510 Fluorescent Confocal 
microscope. AraC antimitotic treatment was performed as described previously^'. 
Mouse brains were OCT embedded and then 12 ]im frozen sections were 
processed for immunostaining with antibodies against Doublecortin (Cell 
Signaling #4604; 1 : 500), Polysialic Acid-NCAM, clone 2-2b (Millipore 
#MAB5324, Lot# 1966892; 1 : 500), Glial Fibrillary Acidic Protein, clone GA5 
(Millipore #MAB3402, Lot#1993774; 1:500), MASHl(Abcam #ab38556; 1:500), 
H3K4me3 (Millipore #07-473; 1:500), EdU 5-ethynyl-2'-deoxyuridine 
(Invitrogen #C10337), and Vimentin (Sigma V2258;l : 500). Coronal slices of fresh 
baboon forebrain were fixed in 4% paraformaldehyde overnight, cryo-protected 
with 30% sucrose/PBS prior to OCT embedding and sectioning (60 \im). 
Antibodies used for co-immunostaining are described above in addition to Sox2 
(Cell Signaling #3579 and Millipore #AB5603; 1 :500) and Nestin (Abeam 
#AB27952; 1 : 500). Fluorescent labeling with secondary antibodies AlexaFIuor 
488 (Molecular Probes, dilution 1 : 1000) and AlexaFIuor 594 (Molecular Probes 
1 : 1000) were acquired by confocal microscope. Vectashield with DAPI (Vector 
Laboratories Ltd, # H-1200) was used for mounting medium and counter stain. 
Baboon SVZ imaging at 40X focal stacks and 40X/100X 2um single slice were 
taken on a Zeiss 710 two-photon confocal microscope under the control of the 
Zeiss Zen 2011 software (Carl Zeiss Microscopy; Germany). Mouse SVZ imaging 
at 40 X 2um single slice was taken on a Zeiss 510 confocal microscope. 

Western blot analysis of human brain tissue samples. The snap-frozen brain tissue 
samples were homogenized in 1 ml Buffer A (10 mM Tris-HCl pH — 8, 250 mM 
Sucrose, 10 mM MgCl2, 1 mM EGTA) with IX protease inhibitor to extract 
cytoplasmic proteins. The resulting pellets were further homogenized in 1 ml RIPA 
buffer containing IX protease inhibitor to isolate nuclear fraction. The total protein 
concentrations in cytoplasmic and nuclear fractions were quantified by Bradford 
assay (Bio-Rad). Bulk chromatin extraction was performed with histone extraction kit 
from Active Motif For western blot, equal amounts of protein from normal or GBM 
specimens were denatured in final IX SDS stop buffer and subjected to SDS-PAGE for 
western blot analysis with antibodies against H3K4me3 (Millipore #07-473; 1 : 1000), 
H3, H4, MLLl, MLL2, MLL3, and y-tubulin (Sigma #T5326; 1 : 1000). Subsequently, 
HRP- conjugated secondary IgG (Cell Signalling; 1 : 2000) and enhanced 
chemiluminescence kit (ECL plus; GE) were used for detection. Of note, blotting 
membranes were cut into strips to accommodate the molecular weights of MLL, 
H3K4me3, and y-tubulin for antibody incubation separately. Anti-H3K4me3 was 
stripped after ECL detection and the blots were subsequently incubated with 
unmodified H3 and H4 antibodies (internal controls). 

Gene expression analysis. Total RNA was extracted using TRIzol reagent 
(Invitrogen) and cDNA was synthesized by cDNA synthesis kit (Applied Biosystems) 
according to manufacturer's instruction. The expression level was measured by Real- 
Time PGR (RT-PGR) with Power SYBR Green detection on ABI7900HT system (SDS 
2.4, Applied Biosystems). S16 was used as an internal control for normalization (ACt). 
The expression level was measured by AACt (log2 scale) after subtraction of ACt 
between GBM and control. A heatmap was generated using the heatmap2 function in 
R. A hierarchical clustering algorithm with Euclidean distance metric and complete 
linkage was implemented to generate the hierarchical tree. 



Flow cytometry. Isolated SVZ cells were incubated on ice for 10 minutes with anti- 
mouse CD16/CD32 (clone 2.4G2; BD Pharmingen) to block FcRs. Cells were then 
incubated on ice for 30 min with primary antibodies (anti-GFAP, Vimentin, GAD65, 
NeuN, PSA-NCAM, and Doublecortin), and subsequently labeled with Alexa fluor 
488- and PE-conjugated secondary antibodies (BD Biosciences). Primary antibody 
resources: Doublecortin (Cell Signaling #4604; Millipore clone2G5 MABN707), 
Polysialic Acid-NCAM, clone 2-2b (Millipore #MAB5324, Lot# 1966892), Glial 
Fibrillary Acidic Protein, clone GA5 (Millipore #MAB3402, Lot#1993774; Pierce 
#MAl-35376; Abeam clone 2A5 #ab4648), Sox2 (Cell Signaling #3579 and Millipore 
#AB5603; 1 : 500), Nestin (Abeam #AB27952; 1 : 500), NeuN (Millipore MAB377), 
and GAD65 (Abeam #AB11070). For analysis, controls including the positive 
controls stained with each antibody separately, the isotype controls, and the 
unstained cells were used for gate compensation. Flow data was acquired on a LSR-II 
(BD Biosciences) configured with an argon 488 laser with a 505 LP dichroic and 525/ 
50 filter to detect Alexa fluor 488 and a green 510 laser with a 735 LP dichroic and a 
575/26 filter to detect PE. Compensation and data analysis was performed using 
Flowjo software (Tree Star, Inc, Ashland, OR). 
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